Analysis of L2 English speech corpus by automatic phoneme alignment
نویسندگان
چکیده
This study tested the application of adapted HTK for automatic alignment of speech corpus of Asian speakers’ English. The HTK tool with TIMIT has problems in aligning non-native speakers’ English. New sets of phoneme sequences for each word were listed to test if an adapted alignment module could accurately analyze pronunciation of Japanese speakers’ English. The new sets of phoneme sequences produced better alignment of Japanese accented English and showed that the L2 incorporated new alignment module could perform more accurate automatic alignment of L2 English data. The same methods should be able to be applied to other language data.
منابع مشابه
Allophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملTranscription and annotation of a Japanese accented spoken corpus of L 2 Spanish for the development of CAPT applications
This paper addresses the process of transcribing and annotating spontaneous non-native speech with the aim of compiling a training corpus for the development of Computer Assisted Pronunciation Training (CAPT) applications, enhanced with Automatic Speech Recognition (ASR) technology. To better adapt ASR technology to CAPT tools, the recognition systems must be trained with non-native corpora tra...
متن کاملAutomatic alignment of phonetic segments
Speech data transcribed at the phoneme level is important for basic speech technology applications. This paper describes some experiments with an automatic method for aligning a given sequence of phonemes with the corresponding spoken utterance. It is shown that methods borrowed from the field of automatic speech recognition can successfully be adapted to this problem. Results are reported for ...
متن کاملStatistical corpus-based speech segmentation
An automatic speech segmentation technique is presented that is based on the alignment of a target speech signal with a set of different reference speech signals generated by a specific designed corpus-based speech synthesis system that additionally generates phoneme boundary markers. Each reference signal is then warped to the target speech signal. By synthesizing and warping many different re...
متن کاملSPeech Phonetization Alignment and Syllabification (SPPAS): a tool for the automatic analysis of speech prosody
SPASS, SPeech Phonetization Alignment and Syllabification, is a tool to automatically produce annotations which include utterance, word, syllable and phoneme segmentations from a recorded speech sound and its transcription. SPPAS is currently implemented for French, English, Italian and Chinese and there is a very simple procedure to add other languages. The tool is developed for Unix based pla...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011